# Low CER Optimization

Wav2vec2 Large Chinese Zh Cn
Apache-2.0
Chinese speech recognition model fine-tuned based on XLSR-53 large model, supporting 16kHz sampled audio input
Speech Recognition Transformers Chinese
W
wbbbbb
585
40
Wav2vec2 Xls R 300m Zh HK Lm V2
Apache-2.0
An automatic speech recognition model based on XLS-R architecture, optimized for Cantonese (zh-HK), fine-tuned on the Common Voice dataset and enhanced with a 5-gram language model.
Speech Recognition Transformers
W
w11wo
25
0
Wav2vec2 Large Xlsr 53 Chinese Zh Cn
Apache-2.0
A Chinese speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampling rate audio input.
Speech Recognition Chinese
W
jonatasgrosman
3.8M
110
Wav2vec2 Large Xlsr Japanese
Apache-2.0
A fine-tuned model based on facebook/wav2vec2-large-xlsr-53 for Japanese speech recognition tasks.
Speech Recognition Transformers Japanese
W
vumichien
214
5
Wav2vec2 Xls R 300m Korean
Apache-2.0
Korean automatic speech recognition model based on XLS-R architecture, fine-tuned on the Zeroth Korean dataset
Speech Recognition Transformers Korean
W
w11wo
152
6
Wav2vec2 Xls R 300m Japanese
Apache-2.0
This is a Japanese automatic speech recognition model fine-tuned based on facebook/wav2vec2-xls-r-300m, specifically designed for transcribing Japanese audio into Hiragana text.
Speech Recognition Transformers Japanese
W
vitouphy
29
0
W2v Hf Jsut Xlsr53
Apache-2.0
A Japanese automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53 using the Common Voice and JSUT datasets.
Speech Recognition Transformers Japanese
W
qqpann
16
1
Wav2vec2 Large Xlsr 53 Tw Gpt
Apache-2.0
A speech recognition model fine-tuned on Taiwan Mandarin (zh-tw) based on facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampling rate audio input
Speech Recognition Transformers
W
voidful
47
3
Wav2vec2 Xls R 300m Korean Lm
Apache-2.0
Korean automatic speech recognition model based on XLS-R architecture, fine-tuned on the Zeroth Korean dataset with an added 5-gram language model
Speech Recognition Transformers Korean
W
w11wo
23
1
Wav2vec2 Xls R 300m German De
Apache-2.0
This model is a fine-tuned German automatic speech recognition (ASR) model based on facebook/wav2vec2-xls-r-300m on the MOZILLA-FOUNDATION/COMMON_VOICE_7_0 - DE dataset.
Speech Recognition Transformers German
W
AndrewMcDowell
72
3
Wav2vec2 Xls R 300m Japanese
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on the Japanese Common Voice 8.0 dataset based on facebook/wav2vec2-xls-r-300m, supporting Japanese speech-to-text functionality.
Speech Recognition Transformers Japanese
W
AndrewMcDowell
24
0
Wav2vec2 Large Japanese
Japanese speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supports 16kHz sampling rate input
Speech Recognition Japanese
W
NTQAI
316
7
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase